JIPS (Çѱ¹Á¤º¸Ã³¸®ÇÐȸ)
Current Result Document :
ÇѱÛÁ¦¸ñ(Korean Title) |
A Text Similarity Measurement Method Based on Singular Value Decomposition and Semantic Relevance |
¿µ¹®Á¦¸ñ(English Title) |
A Text Similarity Measurement Method Based on Singular Value Decomposition and Semantic Relevance |
ÀúÀÚ(Author) |
Xu Li
Chunlong Yao
Fenglong Fan
Xiaoqiang Yu
|
¿ø¹®¼ö·Ïó(Citation) |
VOL 13 NO. 04 PP. 0863 ~ 0875 (2017. 08) |
Çѱ۳»¿ë (Korean Abstract) |
|
¿µ¹®³»¿ë (English Abstract) |
The traditional text similarity measurement methods based on word frequency vector ignore the semantic relationships between words, which has become the obstacle to text similarity calculation, together with the high-dimensionality and sparsity of document vector. To address the problems, the improved singular value decomposition is used to reduce dimensionality and remove noises of the text representation model. The optimal number of singular values is analyzed and the semantic relevance between words can be calculated in constructed semantic space. An inverted index construction algorithm and the similarity definitions between vectors are proposed to calculate the similarity between two documents on the semantic level. The experimental results on benchmark corpus demonstrate that the proposed method promotes the evaluation metrics of F-measure.
|
Å°¿öµå(Keyword) |
Natural Language Processing
Semantic Relevance
Singular Value Decomposition
Text Representation
Text Similarity Measurement
|
ÆÄÀÏ÷ºÎ |
PDF ´Ù¿î·Îµå
|